First Digit Distribution in Some Biological Data Sets. Possible Explanations for Departures from Benford's Law
نویسندگان
چکیده
Aim: To explore whether the first digit law (FDL) is abided by data sets from biological origin. Materials and Methods: Data were collected from different sources, including gene data length for bacteria, pre-vaccination measles incidence data and absolute values from human MEG recordings. First digit frequencies were computed and compared to predictions from FDL. Simulations included a simple model for two-dimensional epidemics spread and a randomly set upper bound model aimed to explain the behaviour of MEG data. Results: We observed that FDL is obeyed in a case of epidemic data reported at a putative focus of spread (pre-vaccination measles incidence for Preston, England). However, peculiar departures were observed for gene length distribution in microorganisms, magneto-encephalograms (MEG), and epidemic data pooled from large geographical regions. Conclusions: Simulation studies revealed that averaging data on a scenario of propagating waves can explain some of the observed distortions from FDL. This could help to understand the behaviour of epidemics data. A randomly set upper bound model (RUBM) can likely explain the observed behaviour of MEG data. Explanation for gene length data behaviour requires further theoretical work.
منابع مشابه
Detecting Fraud in Bankrupt Municipalities Using Benford's Law
Acknowledgements I would like to thank Professor Flynn, one of my thesis readers, for assisting me in developing and completing this project. His guidance and unrelenting advice helped make this possible. I would like to express my appreciation and gratitude to Professor Massoud, also one of my thesis readers, for introducing me to the accounting field. Thank you for guiding and supporting me t...
متن کاملAssessing Conformance with Benford’s Law: Goodness-Of-Fit Tests and Simultaneous Confidence Intervals
Benford's Law is a probability distribution for the first significant digits of numbers, for example, the first significant digits of the numbers 871 and 0.22 are 8 and 2 respectively. The law is particularly remarkable because many types of data are considered to be consistent with Benford's Law and scientists and investigators have applied it in diverse areas, for example, diagnostic tests fo...
متن کاملThe first digit frequencies of primes and Riemann zeta zeros tend to uniformity following a size - dependent
Prime numbers seem to distribute among the natural numbers with no other law than that of chance, however its global distribution presents a quite remarkable smoothness. Such interplay between randomness and regularity has motivated scientists of all ages to search for local and global patterns in this distribution that eventually could shed light into the ultimate nature of primes. In this wor...
متن کاملUsing the Benford’s Law as a First Step to Assess the Quality of the Cancer Registry Data
BACKGROUND Benford's law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benford's law, and if this can be used in their data quality check process. METHODS We sampled 43 population-based cancer registry pop...
متن کاملApplication of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008